Averaged gene expressions for regression.

نویسندگان

  • Mee Young Park
  • Trevor Hastie
  • Robert Tibshirani
چکیده

Although averaging is a simple technique, it plays an important role in reducing variance. We use this essential property of averaging in regression of the DNA microarray data, which poses the challenge of having far more features than samples. In this paper, we introduce a two-step procedure that combines (1) hierarchical clustering and (2) Lasso. By averaging the genes within the clusters obtained from hierarchical clustering, we define supergenes and use them to fit regression models, thereby attaining concise interpretation and accuracy. Our methods are supported with theoretical justifications and demonstrated on simulated and real data sets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شناسایی ژن‌های مرتبط با بقا در سرطان کلیه با استفاده از روش مؤلفه‌های اصلی لاسو

Background: Identification of correlated genes with survival by gene expression data is an important application of microarray data. The purpose of this study is to identify correlated genes with survival of conventional renal cell carcinoma (cRCC) patients based on gene expression profiles. Methods: This study is a survival analysis with high dimensional covariates and containing 14814 gene...

متن کامل

Hypothalamic KiSS1/GPR54 Gene Expressions and Luteinizing Hormone Plasma Secretion in Morphine Treated Male Rats

Objective The inhibitory effects of Morphine and the stimulatory influence of kisspeptin signaling have been demonstrated on GnRH/LH release. Hypothalamic kisspeptin is involved in relaying the environmental and metabolic information to reproductive axis. In the present study the role of kisspeptin/GPR54 signaling system was investigated on relaying the inhibitory effects of morphine on LH horm...

متن کامل

Integration and Reduction of Microarray Gene Expressions Using an Information Theory Approach

The DNA microarray is an important technique that allows researchers to analyze many gene expression data in parallel. Although the data can be more significant if they come out of separate experiments, one of the most challenging phases in the microarray context is the integration of separate expression level datasets that have gathered through different techniques. In this paper, we prese...

متن کامل

The Effect of Training Type on Hepatic Gene expressions of Apolipoprotein A‐I, and Apolipoprotein A‐II among Male Wistar Rats

Introdaction: Lipid metabolism disorders, especially raised levels of cholesterol and triglycerides increases the risk of atherosclerosis. This study aimed to investigate the effect of training type including submaximal continuous and high-intensity interval training on hepatic gene expression of Apolipoprotein A‐I, and Apolipoprotein A‐II in male Wistar rats.   Materials & Methods: This exper...

متن کامل

Gene transcriptomic profile in arabidopsis thaliana mediated by radiation-induced bystander effects

Background: The in vivo radiation-induced bystander effects (RIBE) at the developmental, genetic, and epigenetic levels have been well demonstrated using model plant Arabidopsis thaliana (A. thaliana). However, the mechanisms underlying RIBE in plants are not clear, especially lacking a comprehensive knowledge about the genes and biological pathways involved in the RIBE in plants. Materials and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biostatistics

دوره 8 2  شماره 

صفحات  -

تاریخ انتشار 2007